Cancer molecular pattern discovery by subspace consensus kernel classification.

نویسنده

  • Xiaoxu Han
چکیده

Cancer molecular pattern efficient discovery is essential in the molecular diagnostics. The characteristics of the gene/protein expression data are challenging traditional unsupervised classification algorithms. In this work, we describe a subspace consensus kernel clustering algorithm based on the projected gradient nonnegative matrix factorization (PG-NMF). The algorithm is a consensus kernel hierarchical clustering (CKHC) method in the subspace generated by the PG-NMF. It integrates convergence-soundness parts-based learning, subspace and kernel space clustering in the microarray and proteomics data classification. We first integrated subspace methods and kernel methods by following our framework of the input space, subspace and kernel space clustering. We demonstrate more effective classification results from our algorithm by comparison with those of the classic NMF, sparse-NMF classifications and supervised classifications (KNN and SVM) for the four benchmark cancer datasets. Our algorithm can generate a family of classification algorithms in machine learning by selecting different transforms to generate subspaces and different kernel clustering algorithms to cluster data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving gene expression cancer molecular pattern discovery using nonnegative principal component analysis.

Robust cancer molecular pattern identification from microarray data not only plays an essential role in modern clinic oncology, but also presents a challenge for statistical learning. Although principal component analysis (PCA) is a widely used feature selection algorithm in microarray analysis, its holistic mechanism prevents it from capturing the latent local data structure in the following c...

متن کامل

A Framework for 3D Object Recognition Using the Kernel Constrained Mutual Subspace Method

This paper introduces the kernel constrained mutual subspace method (KCMSM) and provides a new framework for 3D object recognition by applying it to multiple view images. KCMSM is a kernel method for classifying a set of patterns. An input pattern x is mapped into the high-dimensional feature space F via a nonlinear function φ, and the mapped pattern φ(x) is projected onto the kernel generalize...

متن کامل

Consensus Molecular Subtypes of Colorectal Cancer and their Clinical Implications

The colorectal cancer (CRC) subtyping consortium has unified six independent molecular classification systems, based on gene expression data, into a single consensus system with four distinct groups, known as the consensus molecular subtypes (CMS); clinical implications are discussed in this review based on articles relevant to the CMS of CRC indexed in PubMed as well as the authors’ own ...

متن کامل

A Subspace Kernel for Nonlinear Feature Extraction

Kernel based nonlinear Feature Extraction (KFE) or dimensionality reduction is a widely used preprocessing step in pattern classification and data mining tasks. Given a positive definite kernel function, it is well known that the input data are implicitly mapped to a feature space with usually very high dimensionality. The goal of KFE is to find a low dimensional subspace of this feature space,...

متن کامل

Multi-category classification by kernel based nonlinear subspace method

The Kernel based Nonlinear Subspace (KNS) method is proposed for multi-class pattern classi cation. This method consists of the nonlinear transformation of feature spaces de ned by kernel functions and subspace method in transformed high-dimensional spaces. The Support Vector Machine, a nonlinear classi er based on a kernel function technique, shows excellent classi cation performance, however,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computational systems bioinformatics. Computational Systems Bioinformatics Conference

دوره 6  شماره 

صفحات  -

تاریخ انتشار 2007